Using Metadata Analysis and Base Analysis Techniques in Data Qualities Framework for Data Warehouses

نویسندگان

  • Azwa Abdul Aziz
  • Md Yazid
  • Mohd Saman
  • Mohd Pouzi Hamzah
چکیده

Information provided by any applications systems in organization is vital in order to obtain a decision. Due to this factor, the quality of data provided by Data Warehouse (DW) is really important for organization to produce the best solution for their company to move forwards. DW is complex systems that have to deliver highly-aggregated, high quality data from heterogeneous sources to decision makers. It involves a lot of integration of sources system to support business operations. Problem statement: Many of DW projects are failed because of Data Quality (DQ) problems. DQ issues become a major concern over decade. Approach: This study proposes a framework for implementing DQ in DW system architecture using Metadata Analysis Technique and Base Analysis Technique. Those techniques perform comparison between target values and current values gain from the systems. A prototype using PHP is develops to support Base Analysis Techniques. Then a sample schema from Oracle database is used to study differences between applying the framework or not. The prototype is demonstrated to the selected organizations to identify whether it will help to reduce DQ problems. Questionnaires have been given to respondents. Results: The result show user interested in applying DQ processes in their organizations. Conclusion/Recommendation: The implementation of the framework suggested in real situation need to be conducted to obtain more accurate result.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data and Methods for the Production of National Population Estimates: An Overview and Analysis of Available Metadata

Thomas Spoorenberg Translated by: Elham Fathi Statistical Center of Iran Abstract. Official population estimates can be produced using a variety of data sources and methods. These range from the direct extraction of information from continuously updated population registers to procedures for updating the status of a population enumerated previously in a periodic census. Additional sources and ...

متن کامل

Customer Behavior Mining Framework (CBMF) using clustering and classification techniques

The present study proposes a Customer Behavior Mining Framework on the basis of data mining techniques in a telecom company. This framework takes into account the customers’ behavior patterns and predicts the way they may act in the future. Firstly, clustering technique is used to implement portfolio analysis and previous customers are divided based on socio-demographic features using k</em...

متن کامل

A Conceptual Metadata Framework for Spatial Data Warehouse

Metadata represents the information about data to be stored in Data Warehouses. It is a mandatory element of Data Warehouse to build an efficient Data Warehouse. Metadata helps in data integration, lineage, data quality and populating transformed data into data warehouse. Spatial data warehouses are based on spatial data mostly collected from Geographical Information Systems (GIS) and the trans...

متن کامل

Bank branches efficiency assessment using dynamic data envelopment analysis approach to SBM

A new approach or model to the dynamic DEA, referred to as the adjusted dynamic DEA, is proposed in this study. Adjusted dynamic DEA optimizes the production activity of DMUs by introducing adjustment variables to modify the interconnecting activities between consecutive terms, solving conflicts that arise between terms and between management and shareholders. The non-oriented SBM model is used...

متن کامل

Data Analysis Strategy for Revealing Multivariate Structures in Social-Economic Data Warehouses

This research work is aimed at the development of data analysis strategy in a complex, multidimensional, and dynamic domain. Our universe of discourse is concerned with the data mining techniques of data warehouses revealing the importance of multivariate structures of social-economic data which influence criminality. Distinct tasks require different data structures and various data mining exer...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011